Nearly-Automated Metadata Hierarchy Creation

نویسندگان

  • Emilia Stoica
  • Marti A. Hearst
چکیده

Currently, information architects create metadata category hierarchies manually. We present a nearly-automated approach for deriving such hierarchies, by converting the lexical hierarchy WordNet into a format that reflects the contents of a target information collection. We use the term “nearly-automated” because an information architect should have to make only small adjustments to produce an acceptable metadata structure. We contrast the results with an algorithm that uses lexical co-occurrence statistics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Metadata in Multimedia Information Systems: Creation, Refinement, Use in Surrogates, and Evaluation

Improvements in network bandwidth along with dramatic drops in digital storage and processing costs have resulted in the explosive growth of multimedia (combinations of text, image, audio, and video) resources on the Internet and in digital repositories. A suite of computer technologies delivering speech, image, and natural language understanding can automatically derive descriptive metadata fo...

متن کامل

Complete metadata records in learning object repositories: some evidence and requirements

A learning object can be considered as a unit of instructional content for which a metadata record describing its characteristics and intended educational usage is provided. Metadata records can be used to develop effective search and location of learning objects, and also to develop automated or semi-automated selection and composition tools. In consequence, the quality of metadata records is ...

متن کامل

On the Automated Classification of Web Sites

In this paper we discuss several issues related to automated text classification of web sites. We analyze the nature of web content and metadata in relation to requirements for text features. We find that HTML metatags are a good source of text features, but are not in wide use despite their role in search engine rankings. We present an approach for targeted spidering including metadata extract...

متن کامل

Batch loading in metadata creation: a case study

Purpose – The purpose of this article is to describe a workflow of automated batch loading metadata from existing text to a database. Methodology/Approach – It introduces a case for the experience of metadata creation at Rutgers University Libraries in a collaborative digital project with the Hoboken Public Library in New Jersey. Findings – It is found that a well-designed workflow is crucial t...

متن کامل

Towards Quantifying Limits of Automated Curation of Geospatial Data

Workflow systems are an increasingly popular eScience tool for executing complex sequences of tasks. The large volumes of data created during the course of these computationally intense and datadriven scientific investigations drives research in techniques to automate metadata capture to relieve the burden on the user of manual annotation. In this paper we describe our experience to date in qua...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004